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The error detecting and correcting codes are used in critical applications like 
in intensive care units, defense applications, and require highly reliable data. 
This brief focuses on codes to detect and correct adjacent errors within a 
single clock cycle by using modulo-2 addition of data bits for parity 
generation, syndrome calculation, error location identification and correction 
by improving code rate and minimizing bit overhead. The optimal parity 
codes devised can correct odd number of adjacent errors upto (N/2)-1 data 
bits when compared with the existing codes with less delay. Four optimal 
codes are proposed using existing decimal matrix codes properties. 
The proposed codes prove better in terms of area, delay, power, bit 
overhead, code rate, code efficiency, and with good reliability. 





Code rate The components are developed in veriloghardware description language and 
Matrix code verified for zynq 7000 series field programmable gate array in xilinx 
Parity bits integrated synthesis environment 14.5 Tool. Among the codes devised, 
optimal code-4 proves to be a better code with 65.3% code rate and 53.12% 
bit overhead. Also when compared with other codes, it uses 33.3% less area 
and 1.89% less power delay product for encoder and 32.2% less area and 
0.36% power delay product for decoder respectively. 
This is an open access article under the CC BY-SA license. 
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1. INTRODUCTION 

In the applications like intensive care units where continuous evaluation and back up of data 
extracted from monitoring devices is required for patients, fast and reliable error detecting and correcting 
codes have attracted research [1]-[4]. Even in military applications, the backup data is usually necessary for 
missile launch and guidance [5]-[8]. This attracted a great attention for reliable codes capable of operating at 
high frequencies [9]-[12]. 

The recent developments and advanced research at various levels of abstraction induced hope of 
using very small area specifically allocated in each embedded memory [13]-[15] with built-in self test 
capability [16]-[19]. But the promising future of electronics at nanoscale allows the devices get closer where the 
devices undergo radiation effects giving rise to multiple event upsets that induce soft errors in memories [20]-[23]. 
For embedded memories, the correction capability must be as high as possible within a single clock cycle to 
ensure at-speed testing [24]-[28]. This paper addresses the correction of maximum number of erroneous bits 
as fast as possible by using minimum number of redundant bits and maximum code rate. 

The basic error detection and correction codes utilize the various blocks as shown in Figure 1. 
The encoder uses two step processes i.e., the horizontal vector hamming code encoding and the discussed 
encoders to generate code word. This code word is written into the memory [29], [30]. Using read operation, 
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the data is decoded first by horizontal vector hamming code decoder and later by parity decoder which verify 
the parity bits for correcting detected errors. The entire process of error detection and correction is based on 
Hamming codes for all the existing or proposed parity codes [29], [30]. 


z Encoder Decoder 
ige Input L Parity >» HVHC >—» Memory —»» HVHC +» Parity -» , Data Output 
—— Encoder Encoder Decoder Decoder — 


Figure 1. Error detection and correction process 


Majorly the changes are incorporated in parity encoder and decoder blocks for error detection and 
correction codes. The additional bits required depends on type of method adopted in encoder on the basis of 
hamming distance which aid in the development of code word that is written into memory. To obtain 
efficient error detection and correction codes, the decoder block is optimized with respect to area, delay, and 
power, this paper proposes four new codes, as matrix codes are quite faster than other codes and are easy to 
incorporate for memory. In this paper, the process represented in Figure 1 is used and the codes either 
existing or proposed are used in the place of parity encoder and decoder blocks only. The paper is structured 
as the accessible codes in section 2 which prove to correct a maximum of 32 adjacent bits but use large 
number of parity bits with less code rate. To overcome this, the proposed error detection and correction codes 
i.e., the proposed solution as four optimal parity codes in section 3 are discussed. The section 4 deals with 
simulation results and comparison in terms of assessed parameters followed by conclusion. 


2. METHOD 

The method adopted is represented in two sections 2.1 as accessible codes which describe the 
mechanism of encoding and decoding adopted for proposed codes. Also the proposed codes are described in 
section 2.2 which are optimised for less number of parity bits. Further the designs are considered for 
evaluating technological parameters such as area, power and delay, nontechnological parameters such as 
code rate and bit overhead. 


2.1. Accessible error detection and correction codes 

In prose, several error detection and correction codes are projected, among them the easiest way of 
correcting errors is done by treating data to be written into memory as matrix and developing the redundant 
bits. Usually the data along with parity bits is written into memory and later extracted from memory by read 
operation [29]. The [2] uses the H, V, and D-Bits i.e., the horizontal, vertical and diagonal redundant bits 
respectively as in 3D parity codes [8]. Figure 2 represents 64-bit data in the form of matrix with 
8 rows x 8 columns giving rise to 8-H, 8-V and 15-D Bits. These parity bits are developed by modulo-2 
additions of data bits read from memory. In the decoder, if an error exists, then the combination of all H, V, 
and D Bits indicate the location where the data bit is corrected by flipping it. Hence usually decoder requires 
a complex circuit when compared to encoder. The D bits are used for location of error exactly in the memory. 
The deviation in H bits decides the existence of errors and the V bits indicate the maximum number of 
adjacent bits that are correctable for the given data by using ultrafast decoding scheme [6], [7] for a 
correction capability of maximum of 8 adjacent bits out of 64-data bits. Say if m29 is erroneous, then it is 
reflected in h4, v6, and d6 bits which deviate from the actual encoded parity bits, by using syndrome bits the 
data are corrected. 




















































































































Figure 2. 3D parity check code (8x8 matrix arrangement) 
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Neelima and Subhas [3], the existing 8x8 matrix representation is changed to 4x16 matrix 
representation by using matrix reordering technique to enable the consideration of 16 bits at a time for 
processing of data. The 4x16 matrix uses 39 redundant bits i.e., 4 H bits, 16 V bits, and 19 D bits. From 
Figure 3, say if m29 is erroneous, then it is reflected in h2, v14 and d4 parity bits which after comparison 
with the encoded parity bits corrected by using syndrome bits, for a correction capability of maximum of 16 
adjacent bits for 64-data bits. 








Figure 4. 3D parity check code (2x32 matrix arrangement) 


In Figure 4, the 2x32 matrix uses 67 parity bits i.e., 2 H bits, 32 V bits and 33D bits for 3D parity 
encoder. Say if m29 is erroneous, then it is reflected in h1, v30, and d3 bits which after comparison with the 
original redundant bits, is flipped for correct data output, for a correction capability of maximum of 32 
adjacent errors for 64-bit data. In [4], a similar concept with H, V, forward and backward diagonal parity bits 
which increases the number of parity bits and requires a complex decoder for identification of candidate bit 
for a correction capability of maximum of 3 adjacent bits for 32-data bits. In [5], multidirectional parity code, 
similar to 3D codes uses syndrome for correction, thus reducing the decoder complexity comparatively for a 
correction capability of maximum of 7 adjacent bits for 32-data bits. In [6], horizontal vertical double 
diagonal parity code requires few refinement steps to be added to identify the candidate bits but still adds 
complexity in decoding process due to additional refinement steps, for a correction capability of maximum of 
3 adjacent bits for 32-data bits. 


2.2. Optimal error detection and correction codes 

The matrix codes can be optimized by using minimal number of parity bits for a 2x32 matrix 
representation. This algorithm uses the same process of encoding and decoding as that of decimal matrix 
code. The algorithms use vector adjustment, S={S,32’b0} for most significant bit data correction and 
S={32’b0,S} for least significant bit data correction, for a correction capability of maximum of 31 adjacent 
bits out of 64 data bits. 


Inputs: Enable, H, V, dataread 
Output: dataout 
If Enable = 1 then 

HD[8:0] = dataread[7: 0] + dataread [23:16] 

HD[17:9] = dataread[15:8] + dataread[31:24] 
HD[26:18] = dataread[39: 32] + dataread[55: 48] 
HD([35: 27] = dataread[47: 41] + dataread[63: 56] 

VD; = dataread; @ dataread;432 where i =0,1,...,31 


Inputs: Enable, datain 
Outputs: H, V 
If Enable = 1 then 
H[8:0] = datain[7:0] + datain[23: 16] 


H[17:9] = datain[15:8] + datain[31: 24] 
H[26:18] = datain[39:32] + datain[55:48] 
H[35: 27] = datain[47:41] + datain[63:56] 

V; = datain; @ datain;432 where i = 0,1, ...,31 
Else 
H + 0 and V=0 


(a) 


Syndrome Calculation 
Hdiff = HD @ H 
S - VD @ V 
Error Location and Correction 
If Hdiff = 0 and S = 0 then 
dataout = dataread 
Else 
dataout = dataread @ S 
Else 
dataout = 0 


(b) 


Figure 5. Optimal parity code-1: (a) encoder and (b) decoder 
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The optimal parity code-1 is developed based on the decimal matrix based code except that the 
redundant memory details are not used for error correction as shown in Figure 5, where Figure 5(a) is 
encoder and Figure 5(b) is decoder. The total numbers of redundant bits are 36+32-68 bits for 64-bits of 
data. In encoder, the H-bits are calculated by addition and vertical bits are calculated by xoring the input data 
bits, which are given as inputs to decoder. The encoder reuse technique is used to calculate the decoder 
horizontal decoded and vertical decoded-bits respectively. Then they are xored with the actual encoded 
horizontal and vertical bits for syndrome. If difference in horizontal bits and syndrome are zero, then the 
output data bits are same as that of data read from the memory else the erroneous bits are corrected by 
modulo-2operations on data read from the memory and syndrome. This method can correct odd number of 
adjacent errors i.e., upto (N/2)-1in N-bit data read from memory. So here as S has only 32 bits only either 
half of data bits can be corrected and the other half of the bits remain unchanged. For example, consider the 
datain = 64’b0 then outputs H and V will be 36’b0 and 32’b0 respectively. In decoder say: 


The dataread= 64’b1111111111111111111111111111111000000000000000000000000000000000 
i.e., 31 bits of error in most significant bit resulting in a large variation of value read from memory.Then, 
Horizontal decoded bit = 36’b000000000000000100000000000000000000 
and 
Vertical decoded bit = 32’b L111 1111111 111111111111111111110. 
As H = 36’b0 and V = 32’b0, then 
Hdiff = 36’b000000000000000100000000000000000000 
and 
S= {32’b 11111111111111111111111111111110, 32’b0} 


then dataout = dataread@® S = 64’b0. 

Similarly, the optimal parity code-2 as shown in Figure 6, where Figure 6(a) is encoder and Figure 6(b) 
is decoder. Figure 6 is based on modification of optimal parity code-1 except that the change is in the number 
of H-bits and its calculation by 16-bit addition. Here 34 horizontal bits and 32 vertical bits are required i.e., 
66 parity bits are required for 64-bits of data. For example, consider the datain = 64’b0 then outputs H and V 
will be 34’b0 and 32’b0 respectively. In decoder say: 


The dataread= 64’b1111111111111111111111111111111000000000000000000000000000000000 


i.e., 31 bits of error in most-significant bit (MSB) resulting in a large variation of value read from memory. 
Then, 


Horizontal decoded bit = 34’b00000000000000 10000000000000000000 
and 

Vertical decoded bit = 32’b 11111111111111111111111111111110. 
As H = 34’b0 and V = 32’b0, then 

Hdiff = 34’b0000000000000010000000000000000000 
and 

S = {32’b 11111111111111111111111111111110, 32’b0} 


then dataout = dataread@® S = 64’b0. 

The optimal parity code-3 as shown in Figure 7, where Figure 7(a) is encoder and Figure 7(b) is 
decoder. Figure 7 uses the hamming code parity for horizontal bits and modulo-2 addition for vertical bits 
which reduces the complexity of adder. As per the hamming parity scheme, for each horizontal 
representation of matrix 32-bits require 6-bits of parity bits to obtain hamming code as: 

—  H(0)=xor (0, 1, 3, 4, 6, 8, 10, 11, 13, 15, 17, 19, 21, 23, 25, 26, 28, 30), 
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—  H(1) - xor (0, 2, 3, 5, 6, 9, 10, 12, 13, 16, 17, 20, 21, 24, 25, 27, 28, 31), 

-  H(2)=xor (1, 2,3, 7, 8, 9. 10, 14, 15, 16, 17, 22, 23, 24, 25, 29, 30, 31), 

—  H(3) - xor (4, 5, 6, 7, 8,9, 10, 18, 19, 20, 21, 22, 23, 24, 25), 

— H(4 - xor (11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25), and 

-  H(5)=xor (26, 27, 28, 29, 30, 31). 

So, a total of 12 horizontal bits and 32 vertical bits are required for parity i.e., 44 parity bits for 64-bits of data. 


H[16:0] = datain[15:0] + datain[{31:16] HD[16:0] = dataread[15:0] + dataread[31:16] 
H[33:17] = datain[47:32] + datain[63:48] HD[33:17] = dataread[47:32] + dataread[63:48] 
(a) (b) 


Figure 6. Optimal parity code-2: (a) encoder and (b) decoder 


H[0] = datain[0] @ datain[1] @ ...@ datain[28] @ datain[30] HD[0] = dataread[0] @ dataread[1] @ ...@ dataread[28] @ dataread [30] 
H[1] = datain[0] @ datain[2] @ ...@ datain[28] @ datain[31] HD[1] = dataread[0] @ dataread[2] @ ...@ dataread[28] ® dataread [31] 
H[2] = datain[1] @ datain[2] @ ...@ datain[30] @ datain[31] HD[2] = dataread[1] @ dataread[2] @ ...@ dataread[30] @ dataread [31] 
H[3] = datain[4] @ datain[5] @ ...@ datain[24] @ datain[25] HD[3] = dataread[4] @ dataread[5] @ ...@ dataread[24] @ dataread [25] 
H[4] = datain[11] @ datain[12] @ ...@ datain[24] ® datain[25] HD[4] = dataread[11] @ dataread[12] @ ...@ dataread[24] @ dataread[25] 
H[5] = datain[26] @ datain[27] @ ... ® datain[30] @ datain[31] HD[5] = dataread[26] @ dataread[27] @ ...@ dataread[30] @ dataread[31] 


H[6] = datain[32] @ datain[33] @ ...@ datain[60] @ datain[62] HD[6] = dataread[32] @ dataread[33] @ ...@ dataread[60] @ dataread[62] 
H[7] = datain[32] ® datain[34] @ ...@ datain[60] © datain[63] HD[7] = dataread[32] @ dataread[34] @ ...@ dataread[60] @ dataread[63] 
H[8] = datain[33] @ datain[34] @ ...@ datain[62] @ datain[63] HD[8] - dataread[33] @ dataread[34] @ ...@ dataread[62] @ dataread[63] 
H[9] = datain[36] @ datain[37] @ ...@ datain[56] @ datain[57] HD[9] = dataread[36] ® dataread[37] @ ...@ dataread[56] @ dataread[57] 
H[10] - datain[43] @ datain[44] @ ...@ datain[56] @ datain[57] HD[10] = dataread[43] ® dataread[44] @ ...@ dataread[56] @ dataread[57] 
H[11] = datain[58] @ datain[59] @ ...@ datain[62] @ datain[63] HD[11] = dataread [58] @ dataread[59] @ ...@ dataread[62] @ dataread [63] 


(a) (b) 
Figure 7. Optimal parity code-3: (a) encoder and (b) decoder 

For example, consider the datain = 64’b0 then outputs H and V will be 12’b0 and 32’b0 
respectively. Indecoder say: 

The dataread= 64’b1111111111111111111111111111111000000000000000000000000000000000 
i.e., 31 bits of error in MSB resulting in a large variation of value read from memory. Then, 

Horizontal decoded bit = 12’b011011000000 
and 

Vertical decoded bit = 32'b11111111111111111111111111111110. 
As H = 12’b0 and V = 32’b0, then 

Hdiff= 12’b011011000000 
and 

S= {32’b 11111111111111111111111111111110, 32’b0} 


then dataout = dataread@ S = 64’b0. 


H[0] = datain[0] @ datain[1] @ ... ® datain[30] @ datain[31] HD[0] = dataread(0] @ dataread[1] @ ... ® dataread[30] @ dataread[31] 
H[1] = datain[32] @ datain[33] @ ...@ datain[62] @ datain[63] HD[1] = dataread[32] @ dataread [33] @ ...@ dataread[62] @ dataread[63] 


(a) (b) 
Figure 8. Optimal parity code-4: (a) encoder and (b) decoder 
The optimal parity code-4 as shown in Figure 8, where Figure 8(a) is encoder and Figure 8(b) is 
decoder. Figure 8 is based on the observation that horizontal bits indicate the error and vertical bits indicate 


the syndrome used for correction, the horizontal parity bits are replaced by only two N/2-bit XOR operation 
for each row of matrix as shown in algorithm below. So, a total of 2 horizontal bits and 32 vertical bits are 
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required for parity i.e., 34 bits for 64-data bits. Hence, the OPC-4 improves the code rate. For example, 
consider the datain = 64’b0 then outputs H and V will be 2”b0 and 32’b0 respectively. In decoder say: 


The dataread = 64’b1111111111111111111111111111111000000000000000000000000000000000 
i.e., 31 bits of error in MSB resulting in a large variation of value read from memory. Then, 
Horizontal decoded bit = 2’b10 
and 
Vertical decoded bit = 32’b1 11111111 11111111111111111111110. 
As H = 2’b0 and V = 32’b0, then 
Hdiff = 2’b10 
and 
S= {32’b 11111111111111111111111111111110, 32’b0} 


Then dataout = dataread@® S = 64’b0. 


3. RESULTS AND DISCUSSION 

The verilog hardware description languageis used to model codes, zynq field programmable gate array 
(28 nm) with part number XC7Z100-1FFG1 156 in Xilinx Integrated Synthesis Environment 14.5 tool is used to 
verify them. The Figure 9 emphasizes simulation result of proposed optimal parity codes with the signals 
represented as dcorrected, en, dread, h and v in the order of visualization with sizes of 64, 1, 64, 2, 32 bits 
respectively. Here the original data is considered to be 64’b0, if the dread is 64’°h000000007 fffffff i.e., 
31 adjacent bits in least significant bit of dread, then all bits are corrected giving rise to correction capability 
of 31 adjacent bits but if more number of adjacent bits are found to be erroneous, then these codes fail to 
correct. Hence the correction capability is 51.56% erroneous bits in 64-bit data read from memory by using 
the proposed optimal parity codes. 


Œ] File Edit View Simulation Window Layout Help 
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Figure 9. Integrated simulator result 


As the focus is to improve the code rate for better efficiency of error detection and correction codes 
with fewer bits overhead. The same is proved from Figure 10, i.e., if maximum code rate is required, then 
horizontal vertical double diagonal code has 70.3% code rate and 42.1% of minimum bit overhead that 
compromises reliability. But if reliability is a concern, then among the codes capable of correcting (N/2)-1 
adjacent bits, optimal code-4 is best suited as it has 65.306% code rate and only 53.125% of bit overhead. 

The area is evaluated in terms of number of slice look-up tables occupied by the codes out of available 
554800 slice look-up tables in zynq field programmable gate array with part number XC7Z100-1FFG1156 for 
their corresponding encoder and decoder. As per the requirement, the area overhead and delay should be kept at 
minimum for a good code. The same is proved for the field programmable gate array based code design in the 
Figure 11 and Figure 12. 

The Figure 11 shows the comparison of existing and proposed codes in terms of slice look-up tables 
area. From Figure 11, among all the codes, optimal parity code-4 occupies minimum area of 30 slice 
look-uptables while the other existing and proposed codes occupy a minimum of 38 to a maximum of 98 
slice look-up tables i.e., the area utilization is reduced by atleast 21.05% to 69.38% for encoder. Similarly, 
the optimal parity codes 3 and 4 occupy minimum area of 80 slice look-up tables while the other existing and 
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proposed codes occupy a minimum of 118 to a maximum of 175 slice look-up tables i.e., the area utilization 


is reduced by atleast 32.2% to 54.28% for decoder. 


From 


671.4 pWs i.e., 


applications. 


Area (#Slice LUTs out of 554800) 


Figure 12, among all the codes, optimal parity code-4 utilises power delay product of 
435.74 pWs while the other existing and proposed codes use a minimum of 403.44 pWs to a maximum of 
the power delay product is reduced by atleast -8% to 35.09% for encoder. Similarly, the 
optimal parity code-4 utilisespower delay product of 736.28 pWs while the other existing and proposed codes 
use a minimum of 763.28 pWs to a maximum of 1115.86 pWs i.e., the power delay product is reduced by 
atleast 3.53% to 34.01% for decoder. Hence the optimal code-4 proves to be best among the codes mentioned 
for error detection and correction especially when large number of bits get damaged and need repair from 
memory or equivalent device within very less time. These are well suited for image or video processing 
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Figure 12. Codes comparison for power delay product 
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4. CONCLUSION 

The embedded memories used in highly sensitive applications are usually error prone devices due to 
congestion at nanometer technologies exposed to radiation effects. During read operation operating at rated 
clock frequency multiple bit upsets happen and they exhibit as soft errors, then the error detection and 
correction codes are used to overcome them. The proposed codes are aimed to increase the code rate by using 
fewer bits overhead. These are capable of correcting at most (N/2)-1 adjacent errors in N bit data. The verilog 
hardware description languagecoded designs are verified in Xilinx tool for 28nm zynq field programmable gate 
array. From results obtained, among all the codes mentioned, the optimal code-4 proves to be efficient with 
65.3% code rate and only 53.125% bit overhead, atleast 21.05% and 32.2% less area utilization for encoder 
and decoder respectively. Also it uses atleast 3.53% and 34.01% less power delay product for its encoder and 
decoder respectively. Further these codes can incorporate low power techniques for faster operation and 
optimized area. 
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